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Abstract. The LISA Parameter Estimation (LISAPE) Taskforce was formed in 
September 2007 to provide the LISA Project with vetted codes, source distribution 
models, and results related to parameter estimation. The Taskforce's goal is 
to be able to quickly calculate the impact of any mission design changes on 
LISA'S science capabilities, based on reasonable estimates of the distribution 
of astrophysical sources in the universe. This paper describes our Taskforce's 
work on massive black- hole binaries (MBHBs). Given present uncertainties in 
the formation history of MBHBs, we adopt four different population models, 
based on (i) whether the initial black-hole seeds are small or large, and (ii) 
whether accretion is efficient or inefficient at spinning up the holes. We compare 
four largely independent codes for calculating LISA'S parameter-estimation 
capabilities. All codes are based on the Fisher-matrix approximation, but in 
the past they used somewhat different signal models, source parametrizations and 
noise curves. We show that once these differences are removed, the four codes give 
results in extremely close agreement with each other. Using a code that includes 
both spin precession and higher harmonics in the gravitational-wave signal, we 
carry out Monte Carlo simulations and determine the number of events that can 
be detected and accurately localized in our four population models. 
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1. Introduction 

These proceedings report results on merging massive black-hole binaries (MBHBs) 
obtained by the LISA Parameter Estimation (LISAPE) Taskforce. The LISAPE 
Taskforce was established at the September, 2007, LISA International Science Team 
(LIST) meeting at ESTEC, under the auspices of the LIST's Working Group lb (Data 
Analysis). The LISAPE Taskforce was charged with developing a set of vetted tools 
for quickly estimating EISA's science reach for various mission configurations and 
gravitational-wave (GW) sources. 

The initial impetus for creating our Taskforce arose because several research 
groups had independently written codes to calculate EISA's capabilities to extract 
the parameters of MBHBs [ll[a[3lll[3[i[2l[i[S[10l[Tll[12l[ll[14], but these 
groups' published results appeared on their face to be discrepant. However, the various 
groups also used somewhat different approximations in their signal models, as well as 
somewhat different assumptions for the LISA noise curve (unfortunately, there has 
never been an "official" LISA noise curve). Therefore it was unclear whether the 
differing results simply reflected these different assumptions, or whether they were 
due to bugs in one or more codes. 

The Taskforce's first goal was to resolve that question. As described in Sec. [H it 
turned out that there were no bugs: the differing results were primarily due to using 
different noise curves, and especially different low-frequency cut-offs. The second goal 
was to arrive at vetted parameter-estimation codes that the LISA Project could safely 
use in its work, e.g., in helping set EISA's sensitivity requirements, as set forth in the 
evolving LISA Science Requirements Document ^15j . 

In addition, the Taskforce wanted to develop a set of models for the distribution of 
MBHB events in the universe: event rates as a function of the black-hole masses, spins, 
and redshift, based on well-motivated assumptions regarding the birth and growth 
history of massive black holes in the universe. The goal was to generate realistic 
source distributions, which wc could "feed" to our vetted parameter-estimation codes 
in Monte Carlo fashion, arriving at realistic ensembles of LISA observations and 
associated parameter-estimation accuracies. Our hope is that in the future other 
LISA researchers will use these same MBHB ensembles when evaluating LISA science 
performance, so that different researchers will be comparing "apples to apples" . 

Of course, today our ignorance concerning MBHB birth and growth is rather 
humbling [TBI Ull [HI- It therefore behooves us to consider a range of plausible 
assumptions. In the end, the Taskforce settled on four representative source 
distributions, constructed from four merger tree models. The four merger-tree models 
arise from two choices for astrophysical inputs: (1) whether the masses of the inital 
"seed" black holes are small or large, and (2) whether accretion is efficient or inefficient 
at spinning up the massive black holes. Many more models could of course be 
developed, but the Taskforce felt that these four do very broadly sample the range of 
possibilities, and so give useful insight into how much EISA's science reach depends 
on the distribution of MBHBs in the universe. 

The organization and procedures of the LISAPE taskforce were modelled on those 
of the quite successful Mock LISA Data Challenge (MLDC) Taskforce (see, e.g., [19]). 
Membership is open, we have group telecons roughly every two weeks, and the telecon 
minutes and other Taskforce documents are posted on our wiki: 



http: / /www. tapir. caltech.edu/dokuwiki/lisape:home 
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It turned out that the creation of our Taskforce was quite timely, since by May, 
2008, we were in a position to contribute significantly to a "descope exercise" that 
the LISA project carried out in that month at NASA's behest. The results of that 
exercise are summarized in R. Stebbins's contribution to this volume. 

The organization of this paper is as follows. In Sec. [2] we describe the four merger- 
tree models that we use, and the underlying astrophysical assumptions. In Sec. [3] 
we describe the inter-code checks that we did to validate our parameter-estimation 
codes. All these codes calculate expected parameter uncertainties via the Fisher- 
matrix formalism [20j . Such results are accurate only up to corrections of order 
where p is the signal-to-noise ratio (SNR). However they are much faster to implement 
than more accurate Markov Chain Monte Carlo methods, and so more suitable for 
exploration of a large parameter space. For our results, presented in SecSj we used the 
parameter-estimation code developed by the Montana/MIT group, since their model 
waveforms already incorporated both higher harmonics and spin-orbital precession 
effects. In Sec. [S] we summarize our conclusions and discuss briefly the planned future 
work of the LISAPE Taskforce. 

2. Massive black- hole merger trees 

The cosmological evolution of massive black holes can be determined by following the 
merger history of dark-matter halos and of the associated black holes by cosmological 
Monte Carlo realizations of the merger hierarchy from early times until the present in 
a ACDM cosmology (Hq = 70 km Mpc~\ flu = 0.3, flA = 0.7). The simulations 
provide the masses, redshifts and spins of merging black holes. 

Two important sources of uncertainty in merger-tree models of black-hole 
formation are (i) the formation mechanism and mass of the first "seed" black holes, and 
(ii) the details of how accretion causes black holes to grow in time (see e.g. [T6l [TTI [18] V 
To bracket these uncertainties we focused on four representative models. 

As a representative model with "light" black-hole seeds we considered the 
Volonteri-Haardt-Madau ( VHM) scenario [21] , where light seed black holes of mscod ~ 
few hundred Mq are produced as remnants of metal-free stars at redshift z > 20. In 
an alternative scenario, "heavy" seeds with mgocd ~ 10^ Mq are formed as the end- 
product of dynamical instabilities arising in massive gaseous protogalactic disks in the 
redshift range 10 < z < 15. To allow for the possibility of heavy seeds, we considered 
a variant of this scenario proposed by Begelman, Volonteri and Rees ([H], henceforth 
BVR) . Both models (light and heavy seeds) can reproduce the AGN optical luminosity 
function in the redshift range 1 ^ -z ^ 6, but they result in different coalescence rates 
of MBHBs and hence in different GW backgrounds [17] . 

To bracket uncertainties in the growth of black holes by accretion, in both 
the "light seed" (VHM) and "heavy seed" (BVR) cases we considered two different 
accretion models. We adopted either the standard "prolonged accretion" scenario, 
where accretion of material with constant angular momentum axis rapidly spins up the 
holes [23l [24] , or a "chaotic accretion" scenario [25] , where accretion always proceeds 
via very small and short episodes, caused by fragmentation of the accretion disc where 
it becomes self-gravitating. Since counter-rotating material spins black holes down 
more efficiently than co-rotating material spins them up, and it is quite unlikely for 
mergers to produce rapidly spinning holes [26j . the chaotic accretion scenario implies 
that black- hole spins are typically rather small [27] . 

For all four models (with heavy/light seeds and efhcient/chaotic accretion), the 
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spin resulting from individual merger events was determined using a semianalytical 
fitting formula based on numerical relativity simulations of the merger process |28) . 
This formula provides the modulus of the final spin given the binary's mass ratio 
q = M1/M2 and the initial spins jj {i = 1,2): jj^j^Kg , jj^ | , jjjl , cos a , cos/3 , COS7). 
The three angles cosa = ii ■ j2, cos/3 = ■ L and C0S7 = ■ L, where a hat 
denotes unit vectors, describe the orientation of the initial spins relative to the binary's 
orbital angular momentum L (see p8] for details). The merger-tree simulations do 
not provide information on the source position in the sky and on the orientation of 
the binary's angular momentum. However, by averaging over different merger trees 
we can reasonably assume that all angles are isotropically distributed in the sky. 

We further assume that spin alignment is not efficient, so that the spin directions 
at merger are isotropically distributed. This assumption may be violated if mergers 
commonly occur in gas-rich environments, and if the torque exerted by the gas is 
efficient in producing alignment of the black hole's angular momenta, as suggested 
in (see for more details). 



2.1. Details of the merger-tree implementation 

A merger tree traces the merger history that leads to a z = dark-matter halo in a 
hierarchical cosmology. For each formation scenario, our sample is based on A^troc 10 
different halo masses [3D] that span the masses of interest (from a dwarf galaxy to a 
cluster of galaxies) . Each merger tree is characterized by the mass of the parent halo 
at 2; = and its Press-Schechter weig ht W^l^ (fc = 1, . . . , iVtico) [311132], which is used 
to scale the results to the (comoving) number density of sources. Furthermore, for 
each halo mass a different number Nj.^^^ of realizations of its merger history have been 
performed, to take into account cosmic variance. Typically, large-mass halos have a 
smaller Press-Schechter weight (inherent in the adopted cosmological model) and a 
smaller number of realizations (due to computational burden). 

For each tree k we produced data files listing black-hole masses, spins and redshifts 
corresponding to "branches" of the tree where a merger event occurs. All quantities 
in these files are measured in the source frame, at variance with the convention used 
in the MLDC (recall that M = (1 + z)Afsource)- Including all merger trees and all 
realizations of each merger tree, in a typical model such as VHM we have at least 
~ 5 X 10* merging events, but many of these events will not be detectable by LISA. 
Once we choose a criterion to select detectable binaries, e.g. by requiring the SNR 
to be larger than some threshold value p > pth in ^ one-year observation time, the 
number of events at a given redshift z per comoving volume is 



k=l j=l -^rcal 

where H{x) is the Heaviside step function, and in practice we choose pth = 10. The 
number of observable events at z = per unit time and redshift is then given by 

2 

(2) 



—— = i-KcNcomiz) 

dzdt 



Dl{z) 



(1 + ^) 



where Dl is the luminosity distance [33j . 
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3. Code Comparisons for Massive Black Holes with Higher-Harmonic 
Corrections 

The importance to GW astronomy of higher-harmonic corrections to post-Newtonian 
(PN) waveforms for massive black-hole inspirals has been recognized long ago [31 H] , 
but recently there have been several papers trying to quantify this effect more 
precisely [9l [TOl [TTl [131 [M] . Though the qualitative conclusions of all these papers were 
similar, the quantitative estimates were different. This has to do with differences in the 
nature of waveforms used in the analysis (time versus frequency domain), the different 
noise power spectral densities (PSDs) employed and different ways of truncating the 
PN waveforms. The main goal of the exercise presented in this section is to bring 
together the results of various groups and compare them using common values for the 
masses, the duration of the signal in the LISA band, the noise PSD and the lower 
frequency cut-off of the detector. 

To this end it was decided that each group would analyze four dif- 
ferent non-spinning MBHB sources. The sources were described by the 
following parameters: individual masses (mi,m2), co- latitude and longitude 
{&Si4's)j the polarization and inclination angles (V', t) or the orbital an- 
gular momentum variables {6L,(j)L), the luminosity distance Dl, the or- 
bital phase at coalescence tpc and the instant of coalescence tc- The 
given parameter sets were (mi/M©, m2/M0, 6*5, (^s, t/j, i, 6*^, (^l, </?c7 ^'i/Gpc) = 
(3 X 10^ 10^,0.39845, 5.10773, 2.41199, 2.77508, 2.20726, 2.85098, 4.04657, 25.8416) for 
case 1, and (3 x 10^, 10^ 1.62168, 0.920401, 1.608812, 0.949798, 2.570387, 0.977606, 
2.697954, 25.8416) for case 2. Note that all angular variables are given in radians, 
and that the chosen luminosity distance corresponds to a redshift z — 3. It was de- 
cided that each group would use waveforms which were correct to 2PN order in both 
amplitude and phase, and for each source we assumed one year of data. The time of 
coalescence is = 1 year in cases la and 2a; in cases lb and 2b, tc = 1.05 years, so 
that the merger occurs after the end of the data set. We assumed common MLDC [41] 
conventions for the angles, and we adopted a common list of physical constants. 

More importantly, all groups agreed to use the noise curve that was used in the 
second round of the MLDC [35- There are two components to the noise model: 
instrumental noise {S„) and confusion noise from the galactic foreground (<S'conf)- For 
our comparison exercise, the (sky-averaged) instrumental noise is 



Snif) 



^ ' 2 V/* 



Sp 



10- 



/ 



(27r/)4 



(3) 



where / is in Hz, i = 5 x 10^ m is the armlength, S'p = 4 x 10^^^ m^ Hz^^ is the 
(white) position noise level, 6*0 = 9 x lO^'^" m^ s"** Hz~^ is the white acceleration noise 
level (assumed equal to the red acceleration noise level) and /* = c/[2ttL) is the LISA 
arm transfer frequency. Note that the term (///*)^/2 in Eq. ([3]) is not strictly due 
to an increase in shot noise at high-frequency. Rather LISA's response to GWs falls 
rapidly for / > /* , and in the low-frequency approximation adopted in our parameter 
estimation codes, this effect is typically accounted for by adjusting the noise curve in 
this fashion. 

The galactic confusion noise is estimated by direct simulation. A Nelemans et al. 
[551 population synthesis code was used to produce a catalog of galactic binaries 
with periods shorter than 2 x lO'' seconds. The catalog contained 2.6 x 10^ detached 
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binaries and 3.4 x 10® interacting binaries. The LISA response to this foreground was 
computed [37] and added to simulated instrument noise. A Bayesian model selection 
technique was then used to determine which systems could be individually identified 
and regressed from the data [37j . The residual was then smoothed using a Savitzky- 
Golay smoothing filter to produce an estimate for the galactic confusion noise. The 
confusion noise used in the comparisons is given by a piecewise fit to the residual for 
a two-year nominal lifetime without interacting binaries: 



5'conf(/) = < 
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-44.62 J 
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-50.92 J 


-4.4 


10 
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10 


-89.68 J 


-20 










(/<io-3), 

(10-3 < / < 10-2-^), 
(10-2-7 < / < 10-2-4), 
(10-2-4 < / < 10-2), 

(/>10-2), 



(4) 



where / is in Hz. While there were differences between the way waveforms were 
treated, the main difference between the various groups was the analytical noise curve 
that was used. Once a common noise curve was agreed upon, most of the disagreement 
between groups disappeared. A final tidying up of the slight disagreements between 
waveform models produced a concrete verification of the results. 

In the next few sections, we outline the different waveform approximations that 
were used. Two groups used time-domain waveforms, and two groups used frequency- 
domain waveforms. To begin with, it seemed reasonable to make comparisons between 
either time-domain or frequency-domain based codes. We then compared between the 
time- and frequency-domain results. The main differences between waveform models 
are summarized in Table [TJ 



Group 



Table 1. Comparisons of the different methods employed in the analysis. All 
waveforms used a 2 PN phase approximation for the study. 



AISSV 



MM 



CP 



TS 



Domain : 
Amplitude: 

Spin : 

Spin Precession 



Frequency 

Full PN expansion 

and truncation. 

No 

No 



Time 

MLDC taper, 
Hann window. 
Yes 
Yes 



Time 

MLDC taper. 

No 
No 



Frequency 

Full PN expansion. 

No truncation. 

Yes 

No 



3.1. Time- Domain Waveforms 

Two independent time-domain codes were used for the analysis. The first was used 
by Cornish and Porter (CP) T3] and is based on a 2PN waveform in both phase and 
amplitude, derived by Blanchet, Iyer, Will and Wiseman [38j. This code uses analytic 
expressions for the evolution of both the orbital phase and frequency. The code 
developed by the Montana/MIT group (MM) [5S] is based on comparable- mass Kerr 
inspiral waveforms as developed by Apostolatos, Cutler, Sussman and Thorne |40j. 
This code used coupled ordinary differential equations to evolve the spin and spin- 
precession equations. For this study, the spin is set to zero. In order to dampen 
ringing in the Fourier domain, both waveforms use the standard MLDC hyperbolic 
truncation function [41j . This function begins to truncate the waveform at an orbital 
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separation of 7M. The MM code also uses a Hann window to prevent leaking of higher 
harmonics into each other in the Fourier transform. 

Table 2. Fractional differences in the claimed parameter-estimation accuracy 
between the CP and MM models. We list the SNR and the accuracy in (from left 
to right): individual masses, time to coalescence, luminosity distance, sky angles, 
orbital inclination, polarization angle and orbital phase at coalescence. Note that 
for each quantity AA, the value in the Table should be multiplied by 10~*. 



Case 


A(SNR) 


Aln(mi) 


A ln(TO2) 


Aln(t,) 


Aln(Z?i) 


A cos 6s 


A0S 


Al 


All; 


Aipc 


la 


1.25 


5.61 


5.84 


6.27 


175.07 


14.57 


30.09 


248.35 


125.69 


57.29 


lb 


1.73 


2.90 


2.89 


3.55 


32.75 


0.48 


3.61 


41.29 


37.28 


3.18 


2a 


18.49 


19.99 


20.05 


74.95 


566.11 


365.62 


90.57 


800.23 


385.82 


146.92 


2b 


0.77 


0.97 


0.98 


0.58 


3.25 


1.04 


1.08 


1.95 


8.55 


0.79 



In Table [2] we show the difference in the claimed parameter-estimation accuracies 
between the two methods. We can see that this difference is typically of the order of 
a percent, and sometimes much less than that: for example, in case la the fractional 
difference between codes in calculated mass accuracy is ~ 6 x 10"''. 

3.2. Frequency-Domain Waveforms 

The two groups using frequency domain codes were Arun, Iyer, Sathyaprakash, Sinha 
and Van Den Broeck (AISSV) HU] and Trias and Sintes (TS) [ni [HI- While 
both groups were using the Stationary Phase Approximation (SPA) to obtain 
frequency-domain waveforms, the treatment of some aspects of waveform generation 
was different. The SPA amplitude of the kth harmonic is composed of a ratio of 
PN series. The AISSV group expanded the ratio of these series and truncated at 
the required 2 PN order. They also used an analytic expression to calculate the 
starting frequency of the waveform in the detector. The TS group expanded both 
PN amplitude series as was necessary, but did not truncate the ensuing ratio. They 
also used a numerical inversion of the PN series for t{f) at t = to find the starting 
frequency. Their code allowed for non-zero spins, but not for spin precession. For 
code comparisons, the spins were set to zero. 

Table 3. Fractional differences in the claimed parameter-estimation accuracy 
between the AISSV and TS models. The parameters we consider are time to 
coalescence, chirp mass, luminosity distance, sky and orbital angular resolution, 
and orbital phase at coalescence. The angular resolution variables are defined by 
AQi = 27r[i;*''S'^'* - (S*<*)^]i/2, where S'^ correspond to the elements of the 
variance-covariance matrix, and i = S, L. Note that again, values in the Table 
should be multiplied by 10~^. 



Case 


Ate 


Aln(Mc) 


Aln(D£) 






Aipc 


la 


3.87 


16.40 


8.35 


1.04 


0.46 


13.18 


2a 


10.22 


24.81 


0.00 


4.69 


1.34 


7.08 



In Table [3] we compare the two groups' results. For this exercise, the AISSV 
group agreed to use the same PN truncation of the amplitude as the TS group. We 
can see that in all cases the results differ by less than 1%. 



Massive Black Hole Binary Inspirals 



8 



3.3. Comparison Between Time- and Frequency- Domain Waveforms 

In initial comparisons between time- and frequency-domain codes we observed 
discrepancies in the SNRs. A visual comparison was made by inverse Fourier 
transforming the frequency domain waveforms and comparing with the time-domain 
waveforms. It turned out that the definition of the phase at coalescence is slightly 
different in the time and frequency domains. In order to match waveforms, a slight 
correction of the phase at coalescence was needed in the frequency-domain waveforms. 

Another aspect affecting the results was the fact that both time-domain models 
used a truncation function to smoothly reduce the signal to zero at the end of the 
inspiral phase. As this truncation function was also affecting the comparison between 
the time- and frequency-domain approaches, it was decided that an upper frequency 
cutoff of 0.1 mHz would be applied to case la, and an upper frequency cutoff of 1 
mHz would be applied to case 2a. This nullified the effect of the time-domain taper. 
In Tabled] we compare the four sets of results from the MM, AISSV, modified AISSV 
and TS entries. We can see that there is excellent agreement between all groups. 

Table 4. Parameter-estimation accuracies for Case 2a for both time- and 
frequency-domain codes, assuming an upper frequency cutoff of 10~^ Hz. We 
present results for the MM, TS and AISSV groups. The AISSV* entry is the code 
modified to match the TS formalism. 



Group 


MM 


TS 


AISSV 


AISSV* 


SNR 


101.2 


100.9 


99.41 


100.8 


Aln{Mc) 


4.651 X 10-5 


4.671 X 10" 


"5 4.678 X lO"'^ 


4.659 X lO-'^ 


Aic(s) 


28.17 


27.14 


27.41 


27.15 


AHDl) 


0.0673 


0.0674 


0.0677 


0.0675 


Ans(deg2) 


17.55 


17.56 


17.53 


17.52 


AnL(deg2) 


17.05 


17.13 


17.57 


17.29 




0.1225 


0.1211 


0.1213 


0.1202 



Finally, only the MM time-domain model had the ability to include at the same 
time higher harmonics, spins and spin precession. While no comparison was possible 
with the other groups taking part in this exercise, the MM code has been tested 
by comparison with an MLDC code written by Babak using the Kidder model to 
evolve spins [351 US] • Because only the MM model had a working Fisher matrix, the 
comparison was done by generating waveforms for both models and then subtracting 
them from each other. This test yielded very small residuals and confirmed the validity 
of the MM treatment of spins and spin precession. 

4. Results 

In the foregoing sections we discussed the merger-tree models used in this work (Sec. (2) 
and described the cross-validation of the different parameter-estimation codes (Sec. [3]). 
Here we will present the source distributions resulting from our merger-tree models, 
and use them for Monte Carlo simulations of LISA observations of MBHBs. One 
purpose of this exercise was to evaluate the impact of different design choices on 
LISA'S ability to accurately measure and localize MBHBs. Therefore, for each source 
distribution we computed parameter accuracies using two different LISA noise curves: 
the "baseline" and "6-link" noise models, described below. 
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Figure 1. The distribution of MBHB mergers as a function of redshift z (top 
panels) and (logarithmic) mass ratio log(g) (bottom panels) for large-seed models 
(left) and small-seed models (right). Thick (black) histograms refer to efficient 
accretion, thin (red) histograms to chaotic accretion. 

4-1- Source distributions 

To assess what science LISA can do with MBHB mergers it is essential to understand 
the distribution of sources as a function of redshift, masses and spins of the component 
black holes. We evaluated parameter accuracies for the four different merger models 
described in SecjH We shall call the models SE (small seeds, efficient accretion), SC 
(small seeds, chaotic accretion), LE (large seeds, efficient accretion) and LC (large 
seeds, chaotic accretion). 

For large-seed models most mergers occur in the redshift range 3 ^ z ^ 8, with 
a peak around z 5 (Fig. [H top left panel). In the case of small seeds, mergers are 
roughly uniform in z over the range A ^ z ^ 15, with a peak around z ~ 12 (Fig. 
[U top right panel). Large-seed models are likely to produce more symmetric binaries 
(which produce larger SNR for fixed total mass). In contrast, small-seed models lead 
to more asymmetric systems (see Fig. [1] bottom panels). Thus, although small-seed 
MBHB mergers could occur more frequently, a smaller fraction would be observed by 
LISA due to their smaller total mass and less symmetric mass ratios. 

Spin precession usually improves parameter-estimation accuracies [Sj |8]. LE 
models are the only ones that produce binaries in which both black holes generally 
have large spins. In SE models the spin is large for the more massive black hole. 
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but it is often negligible for the smaller hole. In chaotic accretion models (LC and 
SC) spins are always negligible. For a more detailed discussion of spin evolution and 
observational implications, see [57]. 



4-2. Baseline and 6-link noise models 

To evaluate the impact of different LISA design choices on the missions science 
performance, for each source distribution we computed parameter accuracies using 
two different LISA configurations and associated noise curves: a "6-link" configuration, 
which allows the construction of all three independent TDI channels, and a "baseline" 
configuration of 4 links, producing a single Michelson channel. The instrumental noise 
Sn is similar for the baseline and 6-link configurations, save for the location of the low 
frequency "wall" , which is at 10"** Hz for the baseline and 3 x 10^^ Hz for the 6-link 
model. The instrumental noise in the Michelson A and E channels is given by 



1 



12L2 



(2 + k)Sp 



l + k + P 



-Sa 



1 



/lo- 



(5) 



where k = cos(///*). Here Sp = 3.24 x 10" 



(2^/)4 "[ ' V ./ 
-22 ^2 Hr^-i jg slightly different from 
the value adopted in Eq. ([3]), corresponding to the baseline currently adopted by the 
LIST, and the second term in square parentheses is the pink acceleration noise level. 
The confusion noise estimates S'conf(/) used in the analysis of MBHB merger trees, 
unlike those for the comparison exercise described in Sec. [31 did include interacting 
binaries. The confusion noise for the baseline is slightly higher than for the 6-link 
configurations as the latter is a factor of ~ 2 more sensitive, and has a five- versus 
three-year nominal lifetime. The total (instrumental plus confusion) noise spectral 
densities for these models are plotted in Figure [2 




Figure 2. Baseline and 6-link noise models (including both instrumental and 
confusion noise) used in the merger-tree analysis. 



4-3. Parameter accuracies 

LISA'S parameter-estimation capabilities were calculated using the MM code to 
compute the covariance matrix. As discussed in Sec. [3l we have cross- validated their 
code against other codes finding excellent agreement. The advantage of the MM code 
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is that it incorporates not only high-order amphtude corrections, but also precessional 
effects due to spin. However, even the MM code does not take advantage of the merger 
and ringdown parts of the signal. Therefore, the parameter accuracies obtained in this 
work probably underestimate LISA's potential performance. 

The statistics of detectability and parameter accuracies are based on a Monte 
Carlo simulation in which we randomly selected 1024 events with the distributions 
discussed in Sec. 14.11 We eliminated from the sample all binaries with total (locally- 
measured) mass larger than 3 x lO^M©, since a) the code often crashed for such high 
masses, b) these cases represented a very small fraction of the population, and c) 
their inspirals take place almost entirely outside the LISA band, and so would be very 
poorly localized in any case. We then calculated 2048 covariance matrices (1024 for 
each of the two LISA noise models) on JPL's Cosmos supercomputer. About two 
percent of the jobs did not finish within a preset 3 hour period, and some 0.1% of the 
simulations had failures; such cases were simply discardecjj. 

Table 5. For each merger-tree model (SE, SC, LE and LC) we list: the total 
number A'^ of MBHB events in LISA's past light cone in a one-year observation; the 
number of events Af^et detectable with SNR larger than 10 in one year; the number 
for which the error in the luminosity distance Dj^ is 10% or less; the number 
that is localizable within 1 and 10 deg^ {^i deg^ S'^d ^lodeg^i respectively); the 
number that can be resolved to within 10 deg'^ with less than 10% errors in 
(-'^lOdeg^ io%D )' ^^'^ same for 1 deg^ in angular resolution and 1 % error 
in Dj^ (^idog2 i%Di^)- Results for the 6-link model are followed by those for the 
baseline model (within parentheses). 



Model 


N 


A^dot 


Nio%Dl 


A^lO dcg2 


A^10dcg2. W%Dl 


Al dcg2 


^ldog2, 1%Dl 


SE 
SC 


80 
75 


33 (25) 

34 (27) 


21 (8.0) 
17 (4.4) 


8.2 (1.5) 
6.1 (0.4) 


7.9 (1.1) 
5.5 (0.4) 


2.2 (0.6) 

1.3 (0.1) 


1.7 (0.1) 
1.3 (0.1) 


LE 
LC 


24 
22 


23 (22) 
21 (19) 


21 (7.7) 
14 (4.3) 


10 (0.8) 
6.5 (0.5) 


10 (0.7) 
5.4 (0.5) 


2.2 (0.1) 
1.8 (0.04) 


1.2 (0.05) 
1.0 (0.1) 


For 


our 


four merger 


-tree models (as listed 


in column 1 of Table [S]) we 


first give 



the total number N of mergers in LISA's past light-cone during a one-year observation 
(column 2) and the number of events Adet that arc detectable in one year with SNR 
larger than 10 (column 3). The rest of the columns show the number of observable 
events with: error in of 10% or less (column 4); angular resolution Ails < 10 deg^ 
(column 5); Afig < 10 deg^ and error in of 10% or less (column 6); Ailg < 1 deg^ 
(column 7); Afig < 1 deg^ and error in of 1% or less (column 8). 

LISA will detect quite a good fraction of the mergers which occur in the universe: 
almost all mergers will be detected in large-seed scenarios, and nearly half of all 
mergers in small-seed scenarios. This is because large seed black holes lead to more 
massive MBHBs with mass ratio q close to one, so they can be seen out to larger 
redshifts. Since the number of mergers in LISA's past light cone is three times larger 
in the case of small seeds, the number of detectable events will be similar in both cases. 
Table [5] shows that the number of detectable events (about 30 per year in small-seed 

X Our covariance matrices are computed in 15 dimensions. Because of large covariances between 
parameters (often close to ±1) the Fisher information matrix is ill-conditioned, and we believe our 
occasional failures to arise when numerical inversion of the Fisher matrix fails within the available 
numerical precision. We will explore the cause in more detail in the future. 
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models, and about 20 per year in large-seed models) seems to be quite insensitive to 
the details of accretion and spin evolution. The number of detectable events is slightly 
smaller for the baseline model than for the 6-link model. 

Measurements of Dl accurate within 1% would allow us to determine the dark 
energy equation-of-state parameter w to better than 1%, but this would not be possible 
if the accuracy in Dl were of order 10% [10]. The typical error in Dl due to weak 
lensing is in the range 5-10% at z ~ 2 [HI [45]. Ideal MBHB sources to do precision 
cosmology ("standard sirens") would be in the redshift range (say) 0.5 < z < 0.8, 
where lensing errors would presumably be of order 2-3%. Therefore, an accuracy 
in Dl of about 1% is needed if we want our standard sirens to be limited only by 
lensing, and not by the random errors due to LISA's instrumental noise. The temporal 
coincidence with any prompt electromagnetic counterpart may offer the best chance 
of identifying a unique host galaxy, especially if we can locate the source within one 
(or a few) square degrees [i5] . 

Table clearly shows that with the merger-tree models and gravitational 
waveforms used in this study, both the baseline design and the 6-link model have 
the potential for precision cosmology. However, the 6-link configuration performs 
significantly better than the baseline design. With the 6-link configuration LISA can 
get the luminosity distance to within 10% for the majority of detectable events. This is 
not really surprising, since the luminosity distance does not have a strong correlation 
with the phasing parameters and our detection threshold is pth = 10. However, only a 
fourth to half of all events can be resolved within a 10 deg^ error box. Once an event 
has a good sky resolution it will also have a good accuracy in Dl, as evidenced by 
the similarity of the numbers in columns 5 and 6: this is due to the high correlation 
between luminosity distance and angular parameters of the source (see e.g. [HIS])- 

The fraction of events that can be localized to within 1 deg^ is far smaller. A few 
events can be localized to such accuracy using one-year observations and the 6-link 
noise curve, while the statistics are too low to determine if events could be localized 
to such accuracy with the baseline design. Finally, while we have considered only 6- 
link and 4-link LISA configurations, it is possible that LISA will operate with 5 links 
for some substantial time. The important point is that for MBHB mergers, LISAs 
sensitivity and parameter extraction accuracy with 5 links should be nearly as good 
as with 6 [47]. Therefore it is quite plausible that ~ 1 event per year could have 
angular resolution and accuracy in luminosity distance good enough to measure the 
dark-energy equation-of-state parameter w [lOj . 

5. Conclusions 

We compared four largely independent codes for calculating LISA's parameter- 
estimation capabilities. All codes are based on the Fisher-matrix approximation, but 
in the past they used somewhat different signal models, source parametrizations and 
noise curves. We showed that once these differences are removed, the four codes give 
results in extremely close agreement with each other. 

Using a code that includes both spin precession and higher harmonics in the 
GW signal, we presented a preliminary exploration of LISA's ability to do precision 
cosmology using four different merger-tree models and two different sensitivity curves. 
For the merger history of galaxies we used either "small" seed black holes (msccd 
IOOM0) or "large" seeds (mseed ~ IO^Mq). In each case we adopted two different 
accretion scenarios (efficient or chaotic accretion), thereby giving us four models for 
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the birth, growth and spin evolution of massive black holes. We then employed a 
carefully tested tool to compute the parameter accuracies for events that LISA can 
detect with an (amplitude) SNR of at least 10. 

Our study shows that LISA has good potential for carrying out precision 
cosmology. There will be about 20 sources with a modest distance measurement (to 
within 10 %) and about 10 sources with a modest sky resolution (10 deg^). Even more 
interestingly, each year LISA may observe a few sources with excellent sky resolution 
(1 deg^) and luminosity distance measurements (to within 1%). Such accuracies are 
good enough to measure the dark-energy equation-of-state parameter w to within a 
percent, the dominant source of uncertainty being errors in the luminosity distance 
from weak lensing of GW events [HI |45] . 

The waveforms used in our study included important physical effects, such as 
high-order PN corrections in amplitude and phase and precessing spins. However 
our waveforms were incomplete in one respect: they did not include the merger 
and ringdown parts of the signal. Future work will include ringdown either in 
a phenomenological way (by estimating the final black hole's parameters from the 
inspiral parameters, and then using the ringdown measurement formalism developed 
in [48]) or by "stitching" PN inspiral waveforms to a set of numerical relativity 
waveforms. As shown recently by Babak et al. [49] (see also [50]) the inclusion of 
the merger and ringdown signal is expected to improve parameter estimation in two 
ways: by enhancing the signal-to-noise ratio and by increasing the signal bandwidth. 
Future studies will need to explore the contribution of merger and ringdown signals 
to detection rates and parameter-estimation accuracy. They should also explore a 
broader class of merger-tree models to get a more realistic evaluation of LISA's ability 
to do precision cosmology. 

The LISAPE Taskforce is presently developing tools similar to those described 
in this paper to explore measurement accuracies from extreme mass-ratio inspirals 
(EMRIs) of compact objects into massive black holes. This work will be reported 
elsewhere. 
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